For Visualization-Based Analysis Tools in Knowledge Discovery Process: A Multilayer Perceptron versus Principal Components Analysis: A Comparative Study

نویسندگان

  • Xavier Polanco
  • Claire François
  • Mohamed Aly Ould Louly
چکیده

Mapping knowledge structures is a key task in Knowledge Discovery in Databases (KDD). In order to display the thematic organization of knowledge, we compare and evaluate two different cartography approaches: principal components analysis (PCA) and a multilayer perceptron (MLP) in "self-association" mode. This kind of MLP can be used to perform a PCA when the activation function is set to the identity function. This allows us to look for the non-linear activation function which best fits the data structure. We present an evaluation criterion and the results and maps obtained with both methods. We notice that the MLP detects a non-linearity in the data structure that the PCA does not detect. However, the MLP does not express the non-linearity completely. Finally we show how a related component analysis (RCA), based on graph theory, provides representations of the inter-clusters relationships, compensating for the approximate nature of the maps, and improving their readability.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing an Ontology for Knowledge Discovery in Iran’s Vaccine

Ontology is a requirement engineering product and the key to knowledge discovery. It includes the terminology to describe a set of facts, assumptions, and relations with which the detailed meanings of vocabularies among communities can be determined. This is a qualitative content analysis research. This study has made use of ontology for the first time to discover the knowledge of vaccine in Ir...

متن کامل

Comparative Analysis of Classification Techniques in Data Mining Using Different Datasets

Data mining is the invention of knowledge and useful information from the large amounts of data stored in databases. It is referred as an analysis study of the Knowledge discovery in database process or KDD. Data mining tools are used in forecasting future trends and behaviours, allowing businesses to make proactive, knowledge-driven decisions. Classification is an important data mining techniq...

متن کامل

Modeling and analysis of leishmaniasis distribution process using multilayer perceptron neural network and support vector regression (Case study: villages of Isfahan province)

Villages located in Isfahan province are one of the areas prone to the spread of cutaneous leishmaniasis, which is characterized by the occurrence of wounds on the skin. To predict the future prevalence of cutaneous leishmaniasis, Continuous monitoring of the spatial distribution of this disease is essential. Disease modeling was performed using two machine learning algorithms called support ve...

متن کامل

Tracking of Doubtful Real Estate Transactions by Outlier Detection Methods: a Comparative Study

Doubtful real estate transactions, with the prices far away from the market prices, appear because of non commercial transactions or efforts in order to hide the taxes. To estimate the right values of parameters, such data must be removed from a data set or robust methods of parameters estimation are to be used, while developing a mass appraisal model. Such transactions are outlying observation...

متن کامل

Knowledge discovery using neural approach for SME's credit risk analysis problem in Turkey

This study proposes a knowledge discovery method that uses multilayer perceptron (MLP) based neural rule extraction (NRE) approach for credit risk analysis (CRA) of real-life small and medium enterprises (SMEs) in Turkey. A feature selection and extraction stage is followed by neural classification that produces accurate rule sets. In the first stage, the feature selection is achieved by decisi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998